Demo of iMAG Possibilities: MT-postediting, Translation Quality Evaluation, Parallel Corpus Production

نویسندگان

Ling Xiao Wang

Ying Zhang

Christian Boitet

Valérie Bellynck

چکیده

An interactive Multilingual Access Gateway (iMAG) dedicated to a web site S (iMAG-S) is a good tool to make S accessible in many languages immediately and without editorial responsibility. Visitors of S as well as paid or unpaid post-editors and moderators contribute to the continuous and incremental improvement of the most important textual segments, and eventually of all. Pre-translations are produced by one or more free MT systems. Continuous use since 2008 on many web sites and for several access languages shows that a quality comparable to that of a first draft by junior professional translators is obtained in about 40% of the (human) time, sometimes less. There are two interesting side effects obtainable without any added cost: iMAGs can be used to produce high-quality parallel corpora and to set up a permanent task-based evaluation of multiple MT systems. We will demonstrate (1) the multilingual access to a web site, with online postediting of MT results "à la Google", (2) postediting in "advanced mode", using SECTra_w as a back-end, enabling online comparison of MT systems, (3) task-oriented built-in evaluation (postediting time), and (4) application to a large web site to get a trilingual parallel corpus where each segment has a reliability level and a quality score.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

iMAG : MT-postediting, translation quality evaluation and parallel corpus production (iMAG : post-édition, évaluation de qualité de TA et production d'un corpus parallèle) [in French]

متن کامل

Online production of HQ parallel corpora and permanent task-based evaluation of multiple MT systems: both can be obtained through iMAGs with no added cost

متن کامل

The taraXÜ corpus of human-annotated machine translations

Human translators are the key to evaluating machine translation (MT) quality and also to addressing the so far unanswered question when and how to use MT in professional translation workflows. This paper describes the corpus developed as a result of a detailed large scale human evaluation consisting of three tightly connected tasks: ranking, error classification and post-editing.

متن کامل

A Web Service Enabling Gradable Post-edition of Pre-translations Pro- duced by Existing Translation Tools: Practical Use to Provide High- quality Translation of an Online Encyclopedia

SECTra_w is a Web-based system offering several services, such as supporting MT evaluation campaigns and online post-editing of MT results, to produce reference translations adapted to classical MT systems not built by machine learning from a parallel corpus. The service we are interested in here is the possibility for its users to import a document, or a set of documents (not only a list of pr...

متن کامل

Multi-Engine and Multi-Alignment Based Automatic Post-Editing and its Impact on Translation Productivity

In this paper we combine two strands of machine translation (MT) research: automatic postediting (APE) and multi-engine (system combination) MT. APE systems learn a target-languageside second stage MT system from the data produced by human corrected output of a first stage MT system, to improve the output of the first stage MT in what is essentially a sequential MT system combination architectu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Demo of iMAG Possibilities: MT-postediting, Translation Quality Evaluation, Parallel Corpus Production

نویسندگان

چکیده

منابع مشابه

iMAG : MT-postediting, translation quality evaluation and parallel corpus production (iMAG : post-édition, évaluation de qualité de TA et production d'un corpus parallèle) [in French]

Online production of HQ parallel corpora and permanent task-based evaluation of multiple MT systems: both can be obtained through iMAGs with no added cost

The taraXÜ corpus of human-annotated machine translations

A Web Service Enabling Gradable Post-edition of Pre-translations Pro- duced by Existing Translation Tools: Practical Use to Provide High- quality Translation of an Online Encyclopedia

Multi-Engine and Multi-Alignment Based Automatic Post-Editing and its Impact on Translation Productivity

عنوان ژورنال:

اشتراک گذاری